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A NONPARAMETRIC INDEPENDENCE TEST USING 
RANDOM PERMUTATIONS * 

By Jesus E. GarciaT'* and Veronica A. Gonzalez-Lopez"!''^ 
Universidade Estadual de Campinas 

We propose a new nonparametric test for the supposition of inde- 
pendence between two continuous random variables X and Y. Given 
a sample of {X,Y), the test is based on the size of the longest in- 
creasing subsequence of the permutation which maps the ranks of 
the X observations to the ranks of the Y observations. We identify 
the independence assumption between the two continuous variables 
with the space of permutation equipped with the uniform distribu- 
tion and we show the exact distribution of the statistic. We calculate 
the distribution for several sample sizes. Through a simulation study 
we estimate the power of our test for diverse alternative hypothesis 
under the null hypothesis of independence. 

1. Introduction. Let {X,Y) be a random vector of continuous vari- 
ables witli unknown joint cumulative distribution H and univariate marginal 
distributions F and G. Call O the space of the univariate, cumulative and 
continuous distributions, then F,G £ 0,. 

Suppose that (xi, yi), • • • , {xn,yn) is a paired sample of size n of {X, Y). 
We want to test the hypothesis 

(1.1) //o : -'^ and y are independent. 

A test is constructed with no extra assumption (other than continuity) about 
the form of the marginal distributions. Let rank{xi) {rank{yi)) be the po- 
sition occupied by Xi (jji) in the sample {xj}^-^^ ({yj}j=i)) the test statistic 
depends on the rank order of the observations. The procedure is based on 
the size of the longest increasing subsequence of the random permutation 
defined by the paired samples. 
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The power of our test is compared with those of various existing tests by sim- 
ulation. Four independence tests are selected for this comparative process, 
namely, Pearson test, Kendall test. Spearman test and Hoeffding test. One of 
them is parametric, the Pearson test, selected by its well known performance 
in the normal case and the other three are nonparametric. Each methodol- 
ogy estimates the association between paired samples and computes a test 
of the value being zero. They use different measures of association, all of 
them in the interval [—1,1] with indicating no association (depending on 
the test's formulation). 

In our simulations, the Hoeffding test has a better power but at the expense 
of not controlling the significance level. In general lines, in the independent 
non normal marginals case, for moderate sample size, our test is the only one 
respecting the significance level. On the other hand, in the dependent case, 
the performance of our test depends on the joint distribution. Assuming nor- 
mal joint distribution and linear dependence between the normal random 
variables, our test performs a lower power compared to the other tests which 
are designed for that case. For the case in which the joint distribution is not 
normal, we performed a simulation study with different conditions. For ex- 
ample, we use a mixture of bivariate normal distributions on the random 
variables. In that case our procedure was competitive and more powerful 
than the other four tests considered. In these simulations just our procedure 
and Hoeffding had a power function going to 1 when the sample size grows. 
Section 2 is devoted to motivate the proposal and provides the main con- 
cepts and the definition of the test statistic. In Section 3 we show how to 
calculate the exact distribution and the asymptotic distribution of the test 
statistic. In Section 4 we show the effectivity of our proposal, using simula- 
tions and we discuss the results. The Appendix A contains a brief overview 
of the tests that we use to compare with our proposal. 

2. Nondecreasing (nonincreasing) subsets. In order to highlight 
the relationship between the values observed of X and Y, we can plot X 
versus Y, detecting in some specific cases evidence about the form of the func- 
tion g connecting the variables. In this way the functional relation Y = g{X) 
could be established. The specification of the form of g is in general a hard 
task. Instead of looking for g directly, we can ask for which kind of ran- 
dom bivariate distribution H assures that Y is almost surely an increasing 
(or decreasing) function of X. The answer is independent of the marginal 
distributions F and G, if F and G are in J7. 

2.1. Perfect dependence. 
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Definition 2.1. IfM= [-00, +00] and = R x R, 

1. a subset S 0/ R^ is nondecreasing if for any {x,y) and {u,v) in S, 
X < u implies y < v (see figure 1); 
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Fig I . The graph of a nondecreasing set. 

2. a subset S of R^ is nonincreasing if for any {x, y) and (n, v) in S, 
X < u implies y > v. 

We refer to 

(2.1) H{x,y)=mm{F{x),G{y)} 

(2.2) H{x, y) = max {0, F{x) + G{y) - 1} 

as the Frechet upper bound and the Frechet lower bound respectively. 

The next theorem establishes that H is identically equal to its Frechet 
upper (lower) bounds if and only if the support of H is concentrated on a 
nondecreasing (nonincreasing) subset. 

Theorem 2.1. Mikusinski et al. [10]. Let be H the joint distribution of a 
pair X, Y of random variables whose one dimensional distribution functions 
are F and G, respectively. Then, 

1. H{x,y) is identically equal to (2.1) if and only if {X,Y) lies almost 
surely in a nondecreasing subset o/R^; 

2. H{x,y) is identically equal to (2.2) if and only if {X,Y) lies almost 
surely in a nonincreasing subset o/R^. 
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If X and Y are continuous, the support of H can have no vertical or 
horizontal lines. When H{x,y) is given by (2.1) (or (2.2)) X and Y are 
continuous, Y is almost surely an increasing (decreasing) function of X. 
Every kind of dependence is found between two pure cases of dependence, 
monotone nonincreasing and monotone nondecreasing as showed by the next 
proposition. 

Proposition 2.1. Nelsen [11]. Consider the same hypotheses as in The- 
orem 2.1 . Then, 

1. max{0,F(x) + G(y) - 1} < i7(x,y) < min{F(x),G(y)} Vx,i/G]R; 

2. max {0, u + w — 1} < C(u, v) < min {u, v} , u,v £ [0, 1] , where C is a 
cumulative distribution (or copula) such that H{x,y) = C{F{x), G{y)). 

Remark 2.1. As showed in the last result, the dependence between X 
and Y is exposed transforming the variables X and Y by the marginal cumu- 
lative F and G, respectively. Under the continuous marginal suppositions, if 
H{x,y) is given by (2.1) (or (2.2)), then P{U = V) = 1 {or P{U = 1-^) = 
1), where U = F{X) and V = G{Y). 

In conclusion, one way to expose the dependence, with little informa- 
tion about the marginal behavior of X and Y, is to use the empirical 
marginal distribution where each marginal observation Xi{yi) is replaced 

1 rank{xi) f rankjyi) n 



Our proposal consists on showing a specific relationship between X and Y 
which makes easy to measure the independence between them. We show the 
relation induced by the empirical copula, replacing the original observations 
by its marginal ranks and we find the longest increasing subsequence defined 
by the graphic of the marginal ranks. First, we note that the distribution 
of the statistic given by the longest increasing subsequence is known under 
the assumption of independence. Second, the longest increasing subsequence 
exposes the tendency of the data to accumulate points into the increasing 
subset defined by the longest increasing subsequence. 

2.2. Construction of a nondecreasing subset using the sample. We con- 
nect the sample with a specific permutation of n points tt^, this permutation 
defines the nondecreasing subset that we use. We explain the procedure us- 
ing the next warm-up example. 

Example 2.1. Let us consider the random sample s, 



n 



n 



{(4.16,3. 



25), (1.15, 3. 



,5), (2.51,4.17), (3.61, 3. 



18), (1.81,2.86)}. 
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First, sort the samples {{xi,yi)}^^i in increasing order in relation to the 
sample {xj}"^^ and replace the Xi value with its rank in the sequence, on our 
example this produces {(1, 3.5), (2, 2.86), (3, 4.17), (4, 3.18), (5, 3.25)} . Next, 
replace each yi with its rank in the {yijiLi sequence, on our example this pro- 
duces {(1, 4), (2, 1), (3, 5), (4, 2), (5, 3)} . The permutation tTs related to this 
sample is defined by 

TTs{l) = 4, ^,(2) = 1, 7r,(3) = 5, 7r,(4) = 2, 7r,(5) = 3. 




Empirical Cumulalive X 



Fig 2. Dispersion's graphic and permutation (Example 2.1). (a) is the dispersion plot for 
the sample, (b) represents the permutation defined by the sample, the black line shows the 
longest increasing subsequence, (c) shows the empirical copula of the sample. 

On this example the longest increasing subsequence is {1,2,3} see figure 
2(b). 

Our test is based on the distribution of the size of the longest increasing 
subsequence of a random permutation of n points, assuming uniform distri- 
bution on the random permutation space. 

Formally, 

Definition 2.2. Let Sn denote the group of permutations o/{l, • • • , nj . 
If TT £ Sn, we say that vr(ii), • • • , 7r(ifc) is an increasing subsequence in vr if 
1 < ii < • • • < ifc < n and 1 < vr(ii) < 71(12) < ■ ■ < 7r(ifc) < n. 

Definition 2.3. Given a random permutation vr G Sn, 
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1. we call Ln{TT) the length of the longest increasing subsequence o/vr; 

2. we call LDn^n) the length of the longest decreasing subsequence o/vr. 

Example 2.2. Consider the set {1, 2, 3, 4, 5, 6, 7, 8} . Let n be the permu- 
tation which transforms the previous set in {3, 6,1,7, 4, 2, 5, 8} where 7r(l) = 
3,7r(2) = 6,7r(3) = l,7r(4) = 5,7r(5) = 7,7r(6) = 2,7r(7) = 4,7r(8) = 8. Ex- 
amples of increasing subsequences are {1,7,8}, {3,6,7,8}, {1,2,5,8}. The 
maximal size for the increasing subsequences is 4 which is reached by the 
sequences {1, 2, 5, 8}, {1,4, 5, 8} and {3, 6, 7, 8}, then Lsiir) = 4. 

Example 2.3. (continued). On the Example 2.1 the longest increasing 
subsequence is {1,2,3} and the value L^{tTs) = 3, see figure 2. 

On the next section we study the distribution of the length of the longest 
increasing subsequence, under the assumption of independence between X 
and Y. 

3. The longest increasing subsequence. Let Sn denote the group 
of permutations of { 1 , • • • , n} and equip Sn with the uniform distribution, 
for A; = 1, 2 • • • , n, we define, 



We denote 3.1 briefly by p^. 

Under the independence hypothesis for the random variables X and Y, every 
possible permutation defined by a random sample of size n, {{xi,yi)}^^i, 
has the same probability 1/n!. Using this fact, the Young tableaux, the 
Schensted theorem by Schensted [13] and the ZS2 algorithm by Zoghbi et 
al. [14], the probabilities could be calculated for each finite n and and k 
with 1 < k < n. 

3.1. The exact distribution of Ln in the case of independence. We will 
touch only a few aspects of the theory, just the necessary in order to show 
how to calculate the distribution. Firstly, we introduce the same concepts. 

Definition 3.1. A standard Young Tableau of order n is an arrange- 
ment of n distinct natural numbers in rows and columns so that the numbers 
in each row and in each column form increasing sequences, and so that there 
is an element of each row in the first column and an element of each column 
in the first row, and there are no gaps between numbers. 



(3.1) 



P{Ln = k) 



7^ { of permutations vr G 5„ : Lniir) = k} 



nl 
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The first row on the standard Young Tableau corresponds to one of the 
longest increasing subsequences. We can construct the Young tableaux com- 
posed by the increasing subsequences originated by some specific permuta- 
tion, as showed in the next example. 

Example 3.1. (continued). For the permutation on Example 2.1 the 
Young tableau is, 

1 2 3 

4 5 

Definition 3.2. If T is a standard Young tableau of order n, for each 
element j, j £ {!,••• ,n} of the arrangement we define the Hook number 
of j as the number of elements in the same column and in the same row in 
which j is included. Counting from the bottom until the element j and from 
the right to the row until the element j. 

Example 3.2. (continued). For the standard Young tableau in Example 
3.1 the Hooks numbers are, 

4 3 1 

2 1 

Remark 3.1. The Hook numbers depend on the form of the Tableau 
not on the numbers filling it. Different permutations of {!,■ ■ ■ ,n} can give 
the same Tableau shape, so, each permutation is directly associated with the 
shape of a Young tableau. 

The next example shows all the possible shapes of the Young tableaux 
that can be obtained by the permutations of 5 numbers. 



Example 3.3. The complete list of shapes and Hooks numbers in each 
shape, admitted by the numbers {1,2,3,4,5} follows in the next table. Each 
element of this list is associated with an integer partition (IP) o/n = 5. 



Shape 1 


Shape 2 


Shape 3 


Shape 4 


Shape 5 


Shape 6 


Shape 7 


5 


51 


52 


521 


431 


5321 


54321 


4 


3 


31 


2 


21 


1 




3 


2 


1 


1 








2 
1 


1 












IP 1(5) 


IP2(5) 


1P3(5) 


IP 4 (5) 


1P5(5) 


IP6(5) 


IP7(5) 


5 


4+1 


3+2 


3+1+1 


2+2+1 


2+1+1+1 


1+1+1+1+1 
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Shape 1 corresponds to the permutation 7r(l) = 5, 7r(2) = 4, vr(3) = 3, 7r(4) = 
2,7r(5) = 1 (LD^ = 5j and it is associated to the integer partition of 
n = 5, IP1(5)=5 (the sum of the number of elements in the first column 
of the shape 1). The shape 5 is associated to the integer partition of n = 5, 
IP5(5)=2+2+l, where each term (from left to right) is the sum of the num- 
ber of elements by column in the shape 5. 

In order to calculate all the possible shapes of Young tableaux of size n, 
having k columns and m rows we use an algorithm which finds these forms 
(or the integer partitions) in an efficient way. 

Given a permutation tt, the size of the longest increasing subsequences 
for vr is the size of the first row in the shape of the Tableaux corresponding 
to the permutation. In other words, the number of permutations vr of n 
numbers such that L{tt) = k, is the number of Young tableaux with a shape 
such that the first row has size k. The number of standard Young tableaux 
with a given shape can be efficiently computed using the following theorem 
by Frame et al. [5]. 

Theorem 3.1. Frame et al. [5] . The number of standard Young tableaux 
with a given shape, containing the integers {1, • • • ,n} is -prn"' where the 

hj,j = l,---,n are the Hook numbers associated with the cells of the Tableau. 

Example 3.4. (continued). The number of standard Young tableaux con- 
taining the numbers {1,2,3,4,5} with shape given by the Example 3.2 is 
5!/[4.3.2] = 5. 

The number of sequences of size n with a longest increasing subsequence 
of size k and longest decreasing subsequence of length m can be calculated 
using the result given by Schensted [13]. 

Theorem 3.2. Schensted [13]. The number of sequences consisting of 
the numbers {1, • • • , n} and having a longest increasing subsequence of length 
k and longest decreasing subsequence of length m, is the sum of the squares of 
the number of standard Young tableaux of identical shape, having k columns 
and m rows. 

Example 3.5. (continued from example 3.3). Considering the numbers 
{1,2,3,4,5} we want to calculate the number of sequences having L5 = 3. 
Let us denote by # {A} the cardinal of A, where A is some set. 
# {L5 = 3} = # {L5 = 3, LD^ = 2} + # {L5 = 3, LD5 = 3} , corresponding 
with only two possible shapes of Young tableaux, with Hook numbers given 
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by the example 3.3, shape 4 o,nd shape 5. Using the Schensted Theorem, 
# {L5 = 3, LDs = 2} = 52 = 25, # {L5 = 3, LDs = 3} = 6^ = 36 and 
#{L5 = 3} = 25 + 36 = 61. 

For a given shape W, we call N{W) the number of standard Young 
tableaux with that shape as given by Theorem 3.1. Let Vn{k,m) be the set 
of shapes of Young tableaux of size n having k columns and m rows. From 
Theorem 3.2, we have that the number of permutations of n elements with a 
longest increasing subsequence of size k and longest decreasing subsequence 
of length m is, 

VFey„(A:,m) 

and the number of permutations of n elements with a longest increasing 
subsequence of size k is 

n 

k=l W£V„{k,m) 

SO we have the following theorem. 

Theorem 3.3. Let Sn denote the group of permutations o/{l,-- - ,n} 
with the uniform distribution. Let Ln^n) be given by definition 2.3 and 
p^, k = 1, - ■ ■ ,n given by the equation 3.1. Then, 

1 " 

(3-2) p^^ = v^T. E 

• k=l W&V„{k,m) 

There are diverse algorithms in the literature to find Vn{k,m), we imple- 
mented the ZS2 algorithm by Zoghbi et al. [14] . 

Using Theorem 3.3 we compute p^ for 1 < k < n, n = 1, ■ ■ ■ , 100. The 
table can be accessed from our R package LIStest. 

3.2. The asymptotic distribution of Ln in the case of independence. The 
asymptotic distribution for random permutations, after appropriate center- 
ing and scaling, was first obtained by Baik et al. [3], as shows the next 
theorem. Let q{z) denote the solution of the Painleve II equation given by, 

Qzz = 2(7^ + zq, satisfying the boundary condition 
q{z) ~ Ai{z) when z — > 00 
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where Ai is the Airy function. Hastings et al. [6] show the asymptotic solu- 
tions, 

g-(4/3).«/2 

q{z) = -Ai{z) + 0( — ) as z ^ oo, 



l(^) = -\l + 0{-2)) as z ^ -oo. 



Now, we define the Tracy- Widom distribution by the next cumulative 
distribution, 

(3.3) FTw{t) = exp ( - ^ {z- t)q^{z)dz^ , t G R. 

Theorem 3.4. Baik et al. [3]. Let Sn denote the group of permutations 
o/{l, • • • ,n} with the uniform distribution. Let ^^(vr) he given by definition 
2.3. Let X a random variable whose distribution function is Ftwi given 
by equation 3.3. Then, as n ^ oo, 



Ln — 2v/n 
\v = 777 > Y m distribution. 

We calculate the quantiles of the Tracy Widom distribution, using the S- 
plus code available in http:/ /www.vitrum.md/andrew/MScWrwck/codes.txt. 
See table 1 for a few values. 

Table 1 

Quantiles for the Tracy- Widom distribution 



a 


a/2 quantile 


(1 — a/2) quantile 


0.001 


-4.44025 


1.54089 


0.01 


-3.91393 


0.74618 


0.05 


-3.44277 


0.09153 



3.3. The Ln test of independence. Let (xi,yi), • • • , {xn,yn) be a paired 
sample of size n of {X, Y) , where X and Y are continuous random variables 
with cumulative marginal F and G respectively; F, G £ 0,. A test for inde- 
pendence can be carried out, as pointed in this section. 
The two-sided statistical tests and P-values are well defined when the test 
statistic has a symmetric distribution, which is not our case. For the asym- 
metric case, the most recent contributions include several proposals. We 
choose to use the doubled two sided P- value because it appears to be simple 
as a starting point. 
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Definition 3.3. The doubled two-sided P-value is given by, 

(3.4) min {2FlMI{Io<Mo} + 2(1 - i^L„(/o))/{/o>Afo}. l} 

where Iq is the observed value of Ln in the sample, F^^ is the cumulative 
distribution function, Fl^{Iq) = X)fc=iPfc T-^ee equation 3.2) and Mq is the 
mode of the distribution. Ie denotes the indicator function of E. 

The previous definition was used for n = 1, • • • , 100. 

For n > 100 we use tlie asymptotic distribution of Ln (see equation 3.3) 
and the quantiles from table 1. If a E (0, 1) is the level of significance, we 
reject the hypothesis of independence, 1.1 if ^°~^"g — < qa/2 or ^°~^"g — > 
9(i-o/2)) where is the 7 quantile of Ftw- 

4. Simulation. To compare the power of our test against the Hoeffd- 
ing, Kendall, Pearson and Spearman test (see Appendix A), we carried out 
a simulation study in which for each test we estimate the power function 
for different sample sizes and diverse joint distributions. For each joint dis- 
tribution and sample sizes 20,40,60,80,100,500,1000 we simulated 10000 
samples, and computed the P- values. 

4.1. Independence. The independence case was tested in several situa- 
tions. We analize pairs of independent random variables with standard nor- 
mal marginal distributions, Pareto marginal distributions, WeibuU marginal 
distributions and Student-t marginal distributions. 

Figure 3 shows the behavior of the empirical power functions, under in- 
dependence when X and Y have standard normal marginals. The power 
function of the statistic L„ (equation 3.1, equation 3.2) is compared with 
other power functions, given by Hoeffding, Pearson, Spearman and Kendall 
test. The power function of our test is smaller than the significance level, we 
can see also how the empirical power function for the Hoeffding test is not 
lower than the significance level. Table 2 shows the power for level 0.05. 

Figure 4 shows the behavior of the empirical power functions, when X 
and Y have independent Pareto marginals, with parameters of scale equal 
to 1; shape parameter equal to 0.25 for the picture on the left and shape 
parameter equal to 4, for the picture on the right. For sample sizes going from 
20 to 100, we can see that the only statistic with empirical power constantly 
lower than the level 0.01 is the L„. We can see also that both, Pearson 
and Hoeffding tests can have empirical powers higher than 4 times the level 
0.01. The other tests do not respect the significance levels for those sample 
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20 40 60 80 100 



Fig 3. Sample size vs. empirical power function at level 0.01 in the case: independent 
N{0, 1) random variables. Hoeffding ("h" in red); Pearson ("p" in green); Spearman ("r" 
in sky); Kendall ("k" in blue); Ln ("I" in black). 



Spearman Kendall Hoeffding Pearson L. 



10 


0.051 


0.049 


0.110 


0.050 


0.022 


20 


0.047 


0.044 


0.070 


0.048 


0.012 


30 


0.048 


0.049 


0.069 


0.047 


0.016 


40 


0.058 


0.056 


0.072 


0.055 


0.016 


50 


0.051 


0.051 


0.062 


0.049 


0.025 


60 


0.048 


0.048 


0.060 


0.046 


0.008 


70 


0.057 


0.054 


0.063 


0.058 


0.012 


80 


0.053 


0.052 


0.057 


0.055 


0.023 


90 


0.046 


0.049 


0.055 


0.049 


0.025 


100 


0.047 


0.048 


0.054 


0.049 


0.021 



Table 2 

Empirical power function at level 0.05. Independent N{0, 1) case. 



sizes. A similar behavior can be seen in figure 5 under WeibuU marginal 
distributions and under t-student marginal distributions in figure 6. This 
behavior can be seen in more details and for larger sample sizes on table 3 
for level 0.01 and table 4 for level 0.05. 

Figure 5 shows the empirical power functions assuming a Weibull for each 
variable with scale parameter equal to 1 and shape parameter equal to 0.25 
on the left and on the right, the scale parameter is equal to 1 and the shape 
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20 40 60 80 100 20 40 60 80 100 



Fig 4. Sample size vs. empirical power Function at level 0.01 in the case of independent 
Pareto random variables with parameters (1, 0.25) and (1,4) for the left and right figure re- 
spectively. Hoeffding ("h" in red); Pearson ("p" tn green); Spearman ("r" in sky); Kendall 
("k" in blue); L„ ("I" in black). 




20 40 60 80 100 20 40 60 80 100 



Fig 5. Sample size vs. empirical power Function at level 0.01 in the case of independent 
Weibull random variables with parameters (1,0.25) and (1,2) for the left and right fig- 
ure respectively. Hoeffdtng ("h" in red); Pearson ("p" in green); Spearman ("r" in sky); 
Kendall ("k" in blue); L„ ("I" m black). 



parameter is equal to 2. Figure 6 shows the empirical power functions under 
Student-t marginals with one degree of freedom on the left and 16 degrees 
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Distribution 


Test 


20 


40 


60 


80 


100 


500 


1000 




Spe 


0.011 


0.011 


0.010 


0.009 


0.010 


0.010 


0.008 




Ken 


0.010 


0.011 


0.010 


0.009 


0.009 


0.010 


0.008 


Pareto(l,0.25) 


Hoe 


0.023 


0.017 


0.013 


0.013 


0.012 


0.009 


0.010 




Pea 


0.051 


0.032 


0.025 


0.019 


0.017 


0.003 


0.003 




Ln 


0.001 


0.002 


0.001 


0.004 


0.005 


0.007 


0.007 




Spe 


0.009 


0.012 


0.010 


0.010 


0.010 


0.009 


0.011 




Ken 


0.009 


0.011 


0.010 


0.010 


0.010 


0.009 


0.011 


Pareto(l,4) 


Hoe 


0.024 


0.017 


0.014 


0.012 


0.011 


0.010 


0.012 




Pea 


0.023 


0.021 


0.018 


0.021 


0.019 


0.018 


0.017 




Ln 


0.001 


0.002 


0.002 


0.003 


0.005 


0.005 


0.008 




Spe 


0.009 


0.011 


0.010 


0.009 


0.010 


0.011 


0.010 




Ken 


0.008 


0.010 


0.011 


0.009 


0.009 


0.011 


0.010 


Weibull(l,0.25) 


Hoe 


0.021 


0.016 


0.013 


0.012 


0.010 


0.011 


0.008 




Pea 


0.015 


0.012 


0.012 


0.011 


0.012 


0.011 


0.011 




Ln 


0.001 


0.002 


0.002 


0.004 


0.005 


0.008 


0.004 




Spe 


0.009 


0.011 


0.010 


0.011 


0.010 


0.011 


0.010 




Ken 


0.008 


0.010 


0.010 


0.010 


0.010 


0.011 


0.010 


Weibull(l,2) 


Hoe 


0.022 


0.016 


0.015 


0.013 


0.012 


0.012 


0.010 




Pea 


0.013 


0.012 


0.012 


0.012 


0.012 


0.011 


0.010 




Ln 


0.001 


0.001 


0.002 


0.003 


0.005 


0.007 


0.005 




Spe 


0.010 


0.010 


0.010 


0.010 


0.009 


0.011 


0.011 




Ken 


0.009 


0.009 


0.009 


0.010 


0.009 


0.011 


0.011 


Student-t(l) 


Hoe 


0.023 


0.016 


0.013 


0.012 


0.013 


0.010 


0.011 




Pea 


0.037 


0.037 


0.034 


0.031 


0.028 


0.021 


0.012 




Ln 


0.001 


0.001 


0.002 


0.003 


0.006 


0.006 


0.006 




Spe 


0.009 


0.010 


0.009 


0.009 


0.011 


0.011 


0.009 




Ken 


0.008 


0.010 


0.009 


0.009 


0.011 


0.011 


0.009 


Student-t(16) 


Hoe 


0.021 


0.015 


0.013 


0.011 


0.013 


0.011 


0.009 




Pea 


0.009 


0.011 


0.010 


0.009 


0.010 


0.008 


0.008 




Ln 


0.001 


0.001 


0.002 


0.004 


0.006 


0.007 


0.005 



Table 3 



Empirical power function at level 0.01, for pairs of i.i.d. random variables. 



of freedom on the right. 

4.2. Dependence. Figure 7 shows the power functions at level 0.01 in the 
bivariate case, with standard normal marginal distributions and correlation 
coefficient equal to 0.7. As is expected for the normal case, the Hoeffding 
and Pearson tests have the highest power functions. Table 5 shows the power 
for level 0.05. 

Figure 8 shows the power empirical functions at level 0.01 in the case of 
a mixture (50-50) of two bivariate with standard normal marginal distribu- 
tions one with positive correlation 0.7 and the other with negative correlation 
—0.7. In this case, the Ln test has the highest power function. Table 7 shows 
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Distribution 


Test 


20 


40 


60 


80 


100 


500 


1000 




Spe 


0.051 


0.051 


0.051 


0.051 


0.051 


0.053 


0.046 




Ken 


0.047 


0.049 


0.050 


0.051 


0.050 


0.053 


0.046 


Pareto(l,0.25) 


Hoe 


0.079 


0.065 


0.060 


0.056 


0.057 


0.054 


0.046 




Pea 


0.058 


0.037 


0.028 


0.022 


0.019 


0.004 


0.003 




Ln 


0.019 


0.024 


0.014 


0.022 


0.028 


0.046 


0.038 




Spe 


0.051 


0.050 


0.052 


0.049 


0.053 


0.048 


0.054 




Ken 


0.049 


0.049 


0.052 


0.049 


0.052 


0.048 


0.054 


Pareto(l,4) 


Hoe 


0.083 


0.065 


0.060 


0.054 


0.058 


0.048 


0.055 




Pea 


0.056 


0.051 


0.050 


0.049 


0.047 


0.048 


0.052 




Ln 


0.019 


0.023 


0.015 


0.021 


0.031 


0.048 


0.038 




Spe 


0.047 


0.051 


0.049 


0.051 


0.050 


0.051 


0.049 




Ken 


0.043 


0.049 


0.050 


0.052 


0.050 


0.052 


0.048 


Weibull(l,0.25) 


Hoe 


0.077 


0.065 


0.058 


0.056 


0.054 


0.050 


0.047 




Pea 


0.047 


0.049 


0.049 


0.047 


0.046 


0.049 


0.053 




Ln 


0.019 


0.022 


0.015 


0.021 


0.028 


0.048 


0.034 




Spe 


0.049 


0.049 


0.051 


0.049 


0.051 


0.049 


0.051 




Ken 


0.045 


0.047 


0.050 


0.049 


0.051 


0.049 


0.050 


Weibull(l,2) 


Hoe 


0.079 


0.065 


0.060 


0.055 


0.059 


0.052 


0.051 




Pea 


0.051 


0.048 


0.049 


0.048 


0.050 


0.054 


0.043 




Ln 


0.016 


0.024 


0.015 


0.021 


0.028 


0.045 


0.036 




Spe 


0.050 


0.051 


0.052 


0.051 


0.051 


0.052 


0.053 




Ken 


0.047 


0.049 


0.051 


0.051 


0.050 


0.050 


0.052 


Student-t(l) 


Hoe 


0.082 


0.065 


0.059 


0.057 


0.057 


0.052 


0.054 




Pea 


0.068 


0.061 


0.053 


0.051 


0.043 


0.029 


0.018 




Ln 


0.019 


0.024 


0.014 


0.020 


0.028 


0.050 


0.034 




Spe 


0.048 


0.049 


0.050 


0.048 


0.052 


0.049 


0.046 




Ken 


0.045 


0.047 


0.051 


0.049 


0.052 


0.048 


0.046 


Student-t(16) 


Hoe 


0.081 


0.063 


0.060 


0.056 


0.057 


0.048 


0.045 




Pea 


0.050 


0.050 


0.050 


0.049 


0.050 


0.051 


0.045 




Ln 


0.020 


0.025 


0.014 


0.022 


0.031 


0.050 


0.033 



Table 4 

Empirical power function at level 0.05, for pairs of i.i.d. random variables. 



the power for level 0.05. 

Tables 6 and 7 show the tendencies of the power function under the mix- 
ture (50-50) of bivariate distributions with standard normal marginal dis- 
tributions, one with a correlation coefficient equal to p and the other with 
a correlation coefficient equal to —p; p taking the values 0.5,0.6,0.7 and 0.9. 
In both tables, the L„ test achieves the higher values in the power function, 
when the sample size grows. 

5. Conclusions. Under the assumption of independence, by construc- 
tion, the Ln test respects the significance level, as showed in the simulation 
study. In contrast, for moderate sample size (between 20 and 100) we re- 
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Fig 6. Sample size vs. empirical power Function at level 0.01 in the case of independent 
Student-t random variables with 1 and 16 degree of fredom for the left and right figure re- 
spectively. Hoeffdmg ("h" tn red); Pearson ("p" tn green); Spearman ("r" in sky); Kendall 
("k" in blue); Ln ("I" in black). 




Fig 7. Sample size vs. empirical power function at level 0.01. Bivariate case, with N{0, 1) 
marginals and p = 0.7. Hoeffding ("h" in red); Pearson ("p" in green); Spearman ("r" in 
sky); Kendall ("k" in blue); L„ ("I" in black). 
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Spearman Kendall Hoeffding Pearson L. 



10 


0.566 


0.555 


0.590 


0.681 


0.215 


20 


0.914 


0.910 


0.893 


0.956 


0.351 


30 


0.988 


0.988 


0.979 


0.996 


0.559 


40 


0.999 


0.999 


0.998 


1.000 


0.618 


50 


1.000 


1.000 


1.000 


1.000 


0.824 


60 


1.000 


1.000 


1.000 


1.000 


0.779 


70 


1.000 


1.000 


1.000 


1.000 


0.877 


80 


1.000 


1.000 


1.000 


1.000 


0.932 


90 


1.000 


1.000 


1.000 


1.000 


0.960 


100 


1.000 


1.000 


1.000 


1.000 


0.973 



Table 5 

Empirical power function at level 0.05. Bivariate case with 7V(0, 1) marginal distributions 

and p = 0.7. 




20 40 60 80 100 



Fig 8. Sample size vs. empirical power function at level 0.01. Mixture (50-50) of two 
bivariate distributions with standard normal marginals, one with p = 0.7 and the other 
with p = —0.7. Hoeffding ("h" m red); Pearson ("p" in green); Spearman ("r" in sky); 
Kendall ("k" in blue); Ln ("I" in black). 

port the lack of control of the power function for Pearson and Hoeffding 
test in the case of heavy tailed marginal distributions, like Weibull, Pareto 
and t- student (with a small degree of freedom). This means that, for small 
n, when we do not have information about the normality of the marginal 
distributions, it is recommended the procedure L„. We enphasize that even 
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p 


Test 


20 


40 


60 


80 


100 


500 


1000 




Spe 


0.007 


0.008 


0.008 


0.008 


0.009 


0.007 


0.006 




Ken 


0.008 


0.009 


0.010 


0.010 


0.010 


0.009 


0.008 


0.5 


Hoe 


0.018 


0.014 


0.012 


0.011 


0.012 


0.014 


0.022 




Pea 


0.016 


0.018 


0.021 


0.020 


0.022 


0.020 


0.021 




Ln 


0.002 


0.004 


0.005 


0.013 


0.025 


0.121 


0.213 




Spe 


0.009 


0.007 


0.007 


0.007 


0.007 


0.006 


0.006 




Ken 


0.010 


0.010 


0.010 


0.010 


0.010 


0.009 


0.008 


0.6 


Hoe 


0.018 


0.013 


0.012 


0.011 


0.013 


0.025 


0.104 




Pea 


0.024 


0.026 


0.025 


0.027 


0.027 


0.028 


0.025 




Ln 


0.003 


0.006 


0.011 


0.030 


0.048 


0.301 


0.528 




Spe 


0.006 


0.006 


0.006 


0.006 


0.007 


0.006 


0.007 




Ken 


0.009 


0.009 


0.009 


0.009 


0.010 


0.009 


0.010 


0.7 


Hoe 


0.017 


0.014 


0.012 


0.012 


0.013 


0.109 


0.865 




Pea 


0.028 


0.030 


0.033 


0.033 


0.033 


0.033 


0.036 




Ln 


0.004 


0.013 


0.024 


0.061 


0.104 


0.655 


0.902 




Spe 


0.006 


0.006 


0.005 


0.005 


0.005 


0.007 


0.005 




Ken 


0.008 


0.009 


0.008 


0.009 


0.009 


0.010 


0.008 


0.8 


Hoe 


0.016 


0.012 


0.013 


0.015 


0.018 


0.889 


1.000 




Pea 


0.037 


0.040 


0.042 


0.041 


0.044 


0.046 


0.046 




Ln 


0.007 


0.029 


0.064 


0.165 


0.251 


0.951 


0.999 




Spe 


0.005 


0.004 


0.005 


0.005 


0.005 


0.005 


0.004 




Ken 


0.007 


0.007 


0.007 


0.007 


0.007 


0.009 


0.007 


0.9 


Hoe 


0.012 


0.014 


0.021 


0.035 


0.058 


1.000 


1.000 




Pea 


0.047 


0.052 


0.054 


0.054 


0.054 


0.053 


0.054 




Ln 


0.017 


0.103 


0.244 


0.501 


0.673 


1.000 


1.000 



Table 6 

Empirical power function at level 0.01. Mixture (50-50) of bivariate distributions with 
N{0, 1) marginals, one with p and the other with —p. 



100 pairs Of i.i.d. Mixture of two bivariate n 



100 paire of i.i.d. Mixture of two bivariate n 



Fig 9. Plot of the sample. Mixture (50-50) of bivariate distributions with N{0,1) 
marginals, one with p and the other with —p. On the left p = 0.5, on the right p = 0.7. 
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p 


Test 


20 


40 


60 


80 


100 


500 


1000 




Spe 


0.042 


0.043 


0.044 


0.042 


0.042 


0.041 


0.040 




Ken 


0.044 


0.049 


0.050 


0.049 


0.050 


0.047 


0.047 


0.5 


Hoe 


0.070 


0.059 


0.054 


0.054 


0.051 


0.074 


0.143 




Pea 


0.071 


0.074 


0.078 


0.075 


0.081 


0.078 


0.078 




Ln 


0.021 


0.026 


0.026 


0.047 


0.073 


0.346 


0.456 




Spe 


0.043 


0.041 


0.040 


0.040 


0.040 


0.037 


0.037 




Ken 


0.048 


0.049 


0.049 


0.050 


0.049 


0.047 


0.049 


0.6 


Hoe 


0.068 


0.057 


0.053 


0.054 


0.054 


0.160 


0.579 




Pea 


0.086 


0.090 


0.088 


0.089 


0.094 


0.093 


0.092 




Ln 


0.024 


0.033 


0.040 


0.088 


0.120 


0.605 


0.782 




Spe 


0.038 


0.036 


0.035 


0.036 


0.038 


0.035 


0.034 




Ken 


0.044 


0.045 


0.048 


0.049 


0.049 


0.048 


0.046 


0.7 


Hoe 


0.063 


0.054 


0.056 


0.061 


0.068 


0.620 


0.999 




Pea 


0.099 


0.102 


0.105 


0.109 


0.110 


0.107 


0.108 




Ln 


0.031 


0.050 


0.076 


0.154 


0.224 


0.881 


0.977 




Spe 


0.036 


0.032 


0.033 


0.032 


0.032 


0.035 


0.034 




Ken 


0.045 


0.045 


0.045 


0.047 


0.045 


0.048 


0.050 


0.8 


Hoe 


0.060 


0.057 


0.067 


0.083 


0.101 


1.000 


1.000 




Pea 


0.117 


0.125 


0.120 


0.121 


0.123 


0.129 


0.128 




Ln 


0.046 


0.099 


0.165 


0.321 


0.435 


0.993 


1.000 




Spe 


0.031 


0.030 


0.030 


0.028 


0.030 


0.031 


0.029 




Ken 


0.039 


0.042 


0.043 


0.043 


0.043 


0.044 


0.040 


0.9 


Hoe 


0.060 


0.079 


0.124 


0.212 


0.357 


1.000 


1.000 




Pea 


0.135 


0.139 


0.139 


0.141 


0.142 


0.140 


0.137 




Ln 


0.093 


0.257 


0.443 


0.702 


0.826 


1.000 


1.000 



Table 7 

Empirical power function at level 0.05. Mixture (50-50) of bivariate distributions with 
N{0, 1) marginals, one with p and the other with —p. 



when the sample size is equal to 1000 our simulations show the lack of con- 
trol of the significance level for Spearman, Kendall, Hoeffding and Pearson 
tests under the assumption of heavy tailed distributions. 
Under the assumption of normality with high correlation coefficient, the 
power function of L„ test grows with the sample size, but the recommended 
procedure is Pearson, as was expected. L„ could be compared with Pearson 
from a sample size equal to 80. Our procedure reports the remarkable be- 
havior of its power function when applied in mixtures of bivariate normal 
distributions. We observed that Pearson can not detect the dependence even 
with a high value of correlation and L„ is recommended in that case. Consid- 
ering the mixture given by 50% of bivariate distribution of standard normal 
marginals with correlation coefficient equal to p and 50% of bivariate dis- 
tribution of standard normal marginals with correlation cocfRcicnt equal to 
—p, we report that Ln shows a growing power function that is much higher 
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when p grows. See for illustration the plots X \sY va two cases, p = 0.5 and 
p = 0.7, figure 9. In those cases the other tests appear less powerful than 

APPENDIX A: BACKGROUND. STATISTICAL TESTS 

The Pearson test, Kendall test. Spearman test and Hoeffding test are tests 
for association between paired samples. Pearson test checks if p = 0, where p 
is the correlation between the two variables X and Y. The test is supported 
by the Student-i statistic and based on Pearson's product moment correla- 
tion coefficient r, which is the correlation between the two variables in the 
sample, t = "_Jr2 has the t distribution with n — 2 degrees of freedom 
when the samples follow the independent normal distribution. 
The Spearmans rank correlation coefficient uses the percentiles of a distribu- 
tion to define the statistic, the formal expression of this coefficient is given by 

= 12 / — F(x)G{y)]dF{x)dG{y). This measure is cheked by the 
Spearmans rank test. {X, Y) is said to be positively quadrant dependent, if 
H{x,y) — F{x)G{y) > Vx,y. So, ps represents an average which measures 
the positive quadrant dependence. Where the average is taken with respect 
to the marginal distributions of X and Y. The sample version of ps is given 
by Ts = E'Liirankixi) - r^){rank{y,) - ^). 

The Kendall Tau is a measure of the condition "total positivity of order 
two" . A pair of random variables {X, Y) with an absolutely continuous dis- 
tribution function H is said to be totally positive of order two if the joint 
density function h{x,y) satisfies h{x2,y2)h{xi,yi) — h{xi,y2)h(x2,yi) > 0, 
whenever xi < X2 and yi < y2- So, the Kendall Tau coefficient defined by 
r = 2 /5oo J-lo[Hx2, y2)h{xi,yi) - h{xi,y2)h{x2,yi)]dxidyidx2dy2 

measures this property. The sample version of r, is defined as the product 
moment correlation of "signs of concordance" , and this is the statistic used 
by the Kendall Tau test. Formally, we define the s function as s{x) = 1 
when X > 0, s{x) = ^ when x = and s{x) = —1 when x < 0. We have, 

■^"■5 = n{n-i) ^ ^ij^j ^(-^^ ~ ^j)^O^i ~ ^O; '^'s explains how well the two se- 
quences follow a monotone order. 

To compute the P-values for each method, we use the "cor.test" function, 
available in the "stat" package from R-project. More details about each test 
may be found in Hollander et al. [8]. 

The last method is based on the Hoeffdings measure, proposed by Hoeffding 
[7] and supported by the notion of distance between two distributions. For- 
mally, the measure is given by A = / J[H{x, y) — F{x)G{y)]'^dH{x, y). This 
measure is appropriate only when H is absolutely continuous. The sample 
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measure A„ is defined by A„ = A — 2{n — 2)B + {n — 2){n — 3)C ^"^f ^' where 

2 

A = ''^{rank{xi) — l){rank{xi) — 2){rank{yi) — l){rank{yi) — 2), 

i=l 
n 

B = ^(ranA;(xi) — 2){rank{yi) — 2)Ti, 

i=l 
n 

C = ^ Ti{Ti - 1), Ti = {j : Xj < Xi and yj < . 
i=l 

In this last case, to compute the P-values, we use the "hoefFd" function, 
available in the "Hmisc" package from R-project. 
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